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Simultaneous Robot-World and Hand-Eye Calibration 

Fadi Dornaika and Radu Horaud, Member, IEEE 


Abstract —Recently, Zhuang, Roth, Sc Sudhakar [1] proposed a 
method that allows simultaneous computation of the rigid trans¬ 
formations from world frame to robot base frame and from hand 
frame to camera frame. Their method attempts to solve a ho¬ 
mogeneous matrix equation of the form AX — ZB. They use 
quaternions to derive explicit linear solutions for X and Z. In 
this short paper, we present two new solutions that attempt to 
solve the homogeneous matrix equation mentioned above: (i) a 
closed-form method which uses quaternion algebra and a positive 
quadratic error function associated with this representation and 
(ii) a method based on non-linear constrained minimization and 
which simultaneously solves for rotations and translations. These 
results may be useful to other problems that can be formulated 
in the same mathematical form. We perform a sensitivity anal¬ 
ysis for both our two methods and the linear method developed 
by Zhuang et al. [1]. This analysis allows the comparison of the 
three methods. In the light of this comparison the non-linear 
optimization method, which solves for rotations and translations 
simultaneously, seems to be the most stable one with respect to 
noise and to measurement errors. 

Keywords — hand/eye calibration, robot/world calibration, 
quaternion algebra. 

I. Introduction 

In order to use a gripper-mounted sensor (such as a cam¬ 
era) for a robot task, the position and orientation of the sensor 
frame with respect to the gripper frame must be known. The 
problem of determining this relationship is referred to as the 
hand-eye calibration problem. One can find this relationship 
by moving the robot and observing the resulting motion of the 
sensor. This calibration problem yields a homogeneous matrix 
equation of the form AX = XB. Several closed-form solutions 
were proposed in the past to solve for X [2], [3], [4], [5] as well 
as a non-linear optimization method [6]. 

Recently, Zhuang et al. [1] proposed a method that allows 
the simultaneous estimation of both the transformations from 
the world-centered frame to the robot-base frame and from the 
gripper frame to camera frame. The identification problem is 
cast into the problem of solving a system of homogeneous matrix 
equations of the form AX = ZB, where X is the gripper-to- 
camera rigid transformation and Z is the robot-to-world rigid 
transformation. Quaternion algebra is applied to derive explicit 
linear solutions for X and Z. 

The mathematical framework of AX = ZB allows one to 
solve for at least two types of robotic configurations. These 
configurations are shown on Figure 1 and Figure 2. It is worth¬ 
while to notice that matrices X and Z can be estimated either 
sequentially or simultaneously. Therefore two approaches are 
possible: 

1. X is estimated first using any hand-eye (or camera- 
gripper) calibration method and Z is estimated by solving 
the equation AX = ZB, or 

2. X and Z are simultaneously estimated by solving AX = 
ZB where both X and Z are unknowns. 
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Fig. 1. Robot /world (Z) and hand/eye (X) calibration. The camera is 
mounted onto the gripper and camera motions are determined using 
a calibration pattern. The world frame is the frame of the calibration 
pattern. 



Fig. 2. Robot/eye (Z) and hand/tool (X) calibration. The tool is 
mounted onto the gripper and tool motions are determined by ob¬ 
serving tool feature points with a camera. The world frame is, in this 
case, identical with the camera frame. 

This paper describes both a closed-form solution and a non¬ 
linear solution for the system of matrix equations AX = ZB. 
These solutions solve for two rotations and two translations that 
are associated with the matrices X and Z. Likewise the lin¬ 
ear method [1] the closed-form and non-linear methods yield a 
unique solution provided that the robot performs two motions 
with distinct rotation axes. The main differences between the 
linear method and the closed-form method introduced in this 
paper are the followings: 

• The linear method first solves linearly for the components 
of two quaternions and second it normalizes these quater¬ 
nions such that they represent rotations. The closed-form 
method solves directly for two unit quaternions and hence 
the constraint that these quaternions must represent two 
rotations is built in the resolution method. 

• The linear method is not feasible for some special configu¬ 
rations (see [1] and below). We show that the closed-form 
method remains feasible for such special configurations. 

We perform a sensitivity analysis for both our methods and 
for the linear method of Zhuang et al. [1]. This analysis allows 
the comparison of the three methods. In the light of both sim¬ 
ulated and real experiments, it appears that the non-linear op¬ 
timization method, which solves for rotations and translations 
simultaneously, performs better than the closed-form method 



which in turn performs slightly better than the linear method. 

The remainder of this paper is organized as follows. Section II 
briefly recalls the problem formulation and presents the linear 
solution suggested by Zhuang et al. [1], The closed-form and 
non-linear methods are described in Section III. Section IV com¬ 
pares the three methods through a sensitivity analysis. Finally, 
Section V describes some experimental results and Section VI 
provides a short discussion. 

II. Problem formulation 

We consider an arbitrary position of the robotic system (refer 
to Figures 1 and 2). From these figures we can write: 

AX = ZB (1) 

In the particular case of a camera, the matrix A is obtained 
by calibrating the camera with respect to a fixed calibrating ob¬ 
ject and its associated frame, called the calibration frame. The 
matrix B is computed using the manipulator’s forward kine¬ 
matics whose parameters are supposed to be known (see [7] for 
an approach which attempts to estimate simultaneously these 
kinematic parameters and the hand-eye transformation). Let 
Ra, Rb, R\ , and R z be the respective 3x3 rotation matrices 
of A, B, X, and Z, and let £a, £b, tx , and tz be the respective 
3x1 translational vectors. Equation (1) can then be written as: 
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and one may easily decompose this equation into a rotation 
equation and a position equation: 

RaRjv = RzRb (2) 

Ra tx + tA = Rz tB + tz (3) 

Equation (3) is a linear equation in tx and tz if Rz is known. 
A. Linear solution 

This solution was suggested in [1]. Let q A , q B , q x , and q z 
be unit quaternions that correspond to the rotation matrices 
Ra, Rb, Rjt, and Rz [8]. Since quaternions can be written 
as a combination of a scalar and a 3-vector, we have q' A = 
[«o,o r ] and so forth. The matrix equation RaRi = RzRb 
is equivalent to the following quaternion equation: 

9 a * Qx = Qz * Qb ( 4 ) 

Expanding eq. (4) using quaternion products yields two con¬ 
straints: a scalar equation and a vector equation: 

aoxo — ax = zobo — b ■ z (5) 

ao * + xo a + a x x = zob + boz — bxz (6) 

where • and x denote the dot-product and the vector product 
in the space of 3-vectors. 

If a o ^ 0, xo can be solved from (5): 

xo = ( a/a 0 ) • x + (bo/a 0 ) z 0 — {b/a 0 ) ■ z (7) 


By substitution of eq. (7) into eq. (6) and using the matrix 
representation to describe the vector and dot products yields: 

(a 0 I + a a T /a o + 11(a)) x+ 

{—bol — ab T /a 0 + 11(6)) z = z 0 b — z 0 ( bo/a 0 )a 

where 0(a) is the skew-symmetric matrix associated with the 
3-vector a. 

Therefore, we obtain (with zo ^ 0): 

= zo (b — (bo/ao) a) (8) 

3x6 6x1 

where u T = \x T , z T ]. 

Equation (8) consists of three linear constraints with six un¬ 
knowns. Therefore, a unique solution for u requires multiple 
measurements. 

The solution of u can be obtained using standard linear al¬ 
gebra techniques. After u is obtained, the components of both 
q x and q z can be determined using the constraints Hq^H 2 = 
[|q z || 2 = 1 and eq. (7). 

Following the solution of Rjv and Rz, the computation of 
tx and tz becomes trivial. Each position of the hand provides 
three linear equations with six unknowns (the components of 
tx and tz). 

III. Problem solution 

In this section we propose two alternatives for estimating Rjc , 
Rz , tx , and tz'- A closed-form method and a non-linear method 
which do not suffer from the above limitations, e.g., a o ^ 0 and 
zo ^ 0. 

The closed-form method uses algebraic properties associated 
with quaternions to cast a sum of squares error function into a 
positive semi-definite quadratic form whose minimization uses 
two Lagrange multipliers. The non-linear method solves for 
all the unknowns simultaneously using standard minimization 
techniques. Interesting enough, the closed-form method is sim¬ 
ilar but not equivalent to the problem of optimally estimating 
rigid motion from 3-D to 3-D point or line correspondences [8], 
[9]. The method introduced in this paper solves simultaneously 
for two rotations in closed form while the methods developed in 
the past solved for one rotation in closed form. 

A. Closed-form method 

We start by building a positive error function that is derived 
from equation (4) as follows. Since the quaternion multipli¬ 
cation can be written in matrix form and with the notations 
introduced in [8] we have: 

9a» * Qx = Q{QAi)Qx 
Qz * Qbi = W{q Bi )q z 

By substituting these equations into (4), we obtain: 

Q(QAi)Qx ~W{q Bi )q z =0 



With matrices Q(q) and W{q) being defined by: 


By developing and grouping terms we obtain: 
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Q{q) = 


W(q) = 


Moreover, these two matrices are orthogonal and for a unit 
quaternion q we have: 

Q{q) T Q{q) = q T qi = i 

W{q) T W{q) = q T ql = I 

The square norm of the corresponding error vector is given 
by the following positive quadratic form: 

II Q{QAi)Qx ~ W(q B i) qz \\ 2 = 

[Q(qAi)qx - W(q Bi )q z ] T [Q[q A i)qx - W(q Bi )q z ] = 
QxQ(QAi) T Q(QAi)Qx +<lzW{q Bi ) T W{q Bi )q z - 
<llW(q Bi ) T Q(q Ai )q x - qxQ(QAi) T W(q Bi )q z 

Let v be an 8-vector given by: 

T r T T1 

« = [lx, Qz\ 

Thus, we can write: 


f{Qx,Qz) = (n-Xi)q T x q x + (n - A 2 )q T z q z + 

Qx^Qz~^Qz^ Qx + Ai + A 2 (11) 

This function passes through a minimum when the first deriva- 
tives vanish. By differentiating with respect to the components 
of q x and q z we obtain: 

(n - \i)q x + Cq z = 0 (12) 

( n - A 2 )q z + C T q x = 0 (13) 

From equation (12) we obtain: 

Qx = A 7=^ Cqz (14) 

and by substituting q x in equation (13) we obtain: 

C T Cq z = (Ai — n)(A 2 — n)q z (15) 

Therefore q z is an eigenvector of the symmetric positive semi- 
definite matrix C r C. Such a matrix has four real positive eigen¬ 
values ai, i = {1...4} and we have an eigenvector a for each 
eigenvalue: 

C — a\G>i 

Notice that by substituting equations (14) and (15) into equa- 
tion (11) we obtain the value of the error function at the point 
where the first derivatives vanish: 


II QiQAi)Qx ~W{q Bi )q z \\ 2 =v T S t v 
with S,: being an 8x8 positive semi-definite symmetric matrix: 


f(Qx,Qz) = Ai+A 2 

Therefore, we must choose an eigenvalue which minimizes 
Ai + A 2 . Let us consider the fact that q x must be a unit quater¬ 
nion. We obtain from equations (12) and (15): 


where Cj = — Q(qAi) T ^(q Bi ) is an orthogonal matrix of rank 
equal to 4. 

Finally, the error function that will allow us to compute q x 
and q z becomes (n is the number of different positions of the 
robot): 

n / n \ 

f(Qx,Qz) = iv = v T ( ^S,: J v = v T Sv (10) 


(Ai - n) 2 


q T z C T Cq z = 


(Ai — n) 2 

Hence, we must have: 


q z ( Ai — n)(A 2 — n)q z = 


A 2 — n 
Ai — n 


Ai = A 2 ^ 0 


Notice that C = C» is the sum of n orthogonal matrices. 

In the general case C has full rank and there may be geometric 
configurations for which C is rank deficient. However, such geo¬ 
metric configurations are very rare in practice and, without loss 
of generality, one may assume that C has always full rank. The 
function f{q x ,q z ) is a positive semi-definite quadratic form 
and one way to minimize it is to use two Lagrange multipliers: 


mj 11 / = {{Qx Qz) S (q x q z ) + 

U H XZ 

Ai (1 — qxQx) + a 2 (1 — q z Qz)) 


The relationship between Ai = A 2 = A and ai, i.e., equation (15) 
is: 

(A — n) 2 = Qj 

which yields the following solutions for A: 

A = n ± yfai 

Since A must be a positive number, one has to select among the 
four positive eigenvalues, the eigenvalue ai such that n ± [ai 
is the smallest positive number. 

Once the rotations, Rjv and R z, have been determined, the 
problem of determining the best translations, tx and tz, be¬ 
comes a linear least-squares problem that can be easily solved 
using standard linear algebra techniques. 



A.l. Configurations defeating the linear method. There are 
two configurations for which the linear method fails to provide 
a solution: zo = 0 and ao = 0 (see Section II-A). Clearly the 
closed-form solution is able to deal with situations for which 
z o = 0. The case ao = 0 is a little bit more complex to analyze. 
First, notice that the 4x4 matrices Q(q) and W (q) have full 
rank for all non null quaternions q. Let, for some i, q Ai = 
[0, af\ T . Q(q Ai ) becomes a skew-symmetric matrix of full rank 
for all a,: ^ 0. Hence, the rank of Sj in equation (9) is not 
affected by such a special case. However there is an ambiguity 
associated with purely imaginary unit quaternions because the 
quaternions q Ai = [0 ,af] T and q Ai = [0, — af] T describe the 
same rotation matrix Ra*. Hence, one has two consider two 
distinct matrices associated with this special configuration: 



r t c, ' 


I -C< 

S t = 

_ 1 

and Sj = 

. -Cf I 


Therefore, any time such a special configuration is present in the 
data, one has two consider two distinct error functions. There 
will be two two distinct solutions for q x and q z . One may sim¬ 
ply consider, among these two solutions, the solution yielding 
the smallest minimum. 

B. Non-linear method 

There are several disadvantages associated with the above 
methods: 

1. The unknowns are estimated in sequence, rotations first 
and then translations. Errors from the first stage propa¬ 
gate to the second stage; 

2. It is well known that the performance of linear resolution 
methods degrades in the presence of noise, and 

3. Unlike non-linear minimization, linear and closed-form so¬ 
lutions do not allow a characterization of both the quality 
of the solution and the confidence associated with the so¬ 
lution. 

In this Section, we propose to overcome the disadvantages 
mentioned above. For this purpose we estimate simultaneously 
the rotations and translations associated with X and Z. This 
leads to a non-linear minimization problem. There are 24 par 
rameters associated with two rotation matrices (18 parameters) 
and two translation vectors (6 parameters). The initialization of 
these unknowns is straightforward because one can use either of 
the two methods outlined above. Non-linear minimization pro¬ 
vides information about both the quality of the solution (the 
depth of the global minimum) and the confidence associated 
with this solution (the width of the global minimum). 

If we have n positions of the robot, the calibration problem 
becomes the problem of solving for a set of 2 n non-linear con¬ 
straints derived from equations (2) and (3), or equivalently, the 
problem of minimizing the following error function: 

/(Rjv, Rz, tx, tz) = 

n 

/n (||R.Ai Rx — Rsi Rz|| 2 ) + 

i=i 
n 

/l2 (||Ra* tx + tAi ~ Rz ts* — tz|| 2 ) + 

i=l 

/i 3 ||RxRx - III 2 + in ||RzR| - I || 2 


The criterion to be minimized is of the form: 

imn|/(x) = ■ * £ IR 24 j 

Therefore, the problem becomes a classical non-linear least- 
squares constrained minimization problem and one can apply 
standard non-linear optimization techniques, such as Newton’s 
method and Newton-like methods [10], [11]. In this error func¬ 
tion, the terms 4?^ are quadratic in the unknowns. Notice that 
the last two terms are penalty functions which constrain the mar 
trices Rjv and Rz to be rotations. The parameters //1 through 
/t 4 are real positive numbers. High values for /x 3 and \i 4 enforce 
the role of the penalty functions In all our experiments we have 
set these parameters to the following values: /ii = P 2 = 1 and 
/i 3 = /i 4 = 10 6 In the next two sections we give some results ob¬ 
tained with the Levenberg-Marquardt non-linear minimization 
method as described in [ 12 ] and in [ 11 ]. 

IV. Sensitivity analysis and method comparison 

One of the most important merits of any calibration method 
is its sensitivity with respect to various perturbations. In our 
problem, there are two main sources of perturbations: errors 
associated with camera calibration and errors associated with 
the robot position. Indeed, the parameters of both the direct 
and inverse kinematic models of robots are not perfect. In order 
to investigate the behavior of the methods in the presence of 
measurement noise we designed a sensitivity analysis based on 
the following grounds: 

• Nominal values for the parameters of both the hand-eye 
transformation X and the robot-to-world transformation 
Z are provided; 

• Also are provided n matrices Ai, ... A„ from which n hand 
positions can be computed with: 

Bj = Z -1 AjX 

• Either Gaussian noise or uniform noise is added to both 
camera and robot positions; the homogeneous transforma¬ 
tions, (X and Z), are estimated in the presence of this 
noise using the three methods described in this paper: the 
linear method, the closed-form method and the non-linear 
method, and 

• We study the variations of the estimation of the hand-eye 
transformation and the robot-to-world transformation as a 
function of the noise being added and/or as a function of 
the number of positions (n). 

Since both rotations and translations may be represented as 
vectors, adding noise to a transformation consists of adding ran¬ 
dom numbers to each one of the vectors’ components. Random 
numbers simulating noise are obtained using a random num¬ 
ber generator either with a uniform distribution in the interval 
[—C/2, +C/2], or with a Gaussian distribution with a standard 
deviation equal to a. Therefore the level of noise that is added 
is associated either with the value of C or with the value of a 
(or, more precisely, with the value of 2<r). In what follows the 
level of noise is in fact represented as a ratio: the amplitude of 
the actual random numbers (C or 2<r) divided by the nominal 
values of the perturbed parameters. 

In the case of a rotation, the vector (quaternion) associated 
with this rotation has a module equal to 1 and therefore the 



ratio is simply either C or 2<r. In the case of a translation the 
ratio is computed with respect to a nominal value estimated 
over all the perturbed translations: 


11 ^nominal | — 


EHiai^ii + ii^ii) 


2n 


where tA; is the translation vector associated with A*. 

For each noise level and for a large number N of trials we 
compute the errors as follows. These errors are: orientation er¬ 
ror and position error. The orientation error is defined as the 
rotation angle in degrees required to align the coordinate sys¬ 
tem of X or Z in its computed orientation with the coordinate 
system in its theoretical orientation. The position error is de¬ 
fined as the norm of the vector which represents the difference 
between the two translation vectors: the computed one and the 
theoretical one, divided by the norm of the second vector. 

In all our simulations we set N = 500, ||ix|| = 229mm, and 
\\tz\\ = 768mm. 

The following figures show the average of the above errors as 
a function of the percentage of noise. The percentage of noise 
varies from 1% to 6%. The full curves (—) correspond to the 
method in [1], the dotted curves (. ..) correspond to the closed- 

form method, and the dashed curves (-) correspond to the 

non-linear method. 


Figures 3 and 4 correspond to three positions (n = 3) of the 
robot while on Figure 5 the number of positions varies from 3 
to 8. 


Figure 3 shows the rotation and translation errors as a func¬ 
tion of uniform noise added to the rotational part of the robot 
and camera positions. Figure 4 shows the rotation and trans¬ 
lation errors as a function of Gaussian noise added to the ro¬ 
tational part of the hand and camera positions. These errors 
are obtained with the three methods. We can conclude that the 
closed-form method is more accurate than the linear method 
proposed in [1]. 

As other authors have done in the past, it is interesting to 
analyze the behavior of calibration methods with respect to the 
number of positions. In order to perform this analysis we have 
to fix the percentage of noise. Figure 5 shows the rotational 
and translational errors as a function of the square root of the 
number of motions (i/n varies from 1.732 to 2.828). The noise 
ratio has been fixed to the worst case for rotations, e.g., 6% and 
to 2% for translations. Both rotational and translational noise 
distributions are Gaussian. 



Noise Level 


(a) Orientation errors. 



Noise Level 


(b) Relative position errors. 

Fig. 3. Errors in orientations and positions in the presence of uniform 
noise perturbing the rotation axes. The full curves (—) correspond 
to the method outlined in section II and the dashed curves (— —) 
correspond to the non-linear method. 


The third columns show the relative error in translation, Ef. 


{ ^2 ||(R.Atx + tA — R ztB — tz 
V £l|R.4fx+fA|| 2 


1/2 


V. Experimental results 

In this Section we report some experimental results obtained 
with two sets of data. The first data set was obtained with 
17 different positions of the hand-eye device with respect to a 
calibrating object. The second data set was obtained with 7 
such positions. In order to calibrate the camera we used the 
classical method proposed by Faugeras & Toscani described in 
[9], 

Our tests compare the linear method [1] with the two methods 
developed in this paper. Table I and Table II summarize the 
results obtained with the two data sets mentioned above. The 
second columns of these tables show the sum of squares of the 
absolute error in rotation, E 

||RaRjt — RzRb|| 2 


It is worthwhile to notice that the robots being used in the 
two experiments summarized in the tables above are not identi¬ 
cal. The first data set (Table I) was obtained with a PPPRRR 
robot (three prismatic and three rotational joints) while the sec¬ 
ond data set (Table II) was obtained with a RRRRRR robot. 
Unlike the simulated data, these two experiments do not allow 
one to conclude that the closed-form solution outperforms the 
linear solution. In the first experiment the linear solution yields 
a smaller translation error than the translation error associated 
with the closed-form method. In the second experiment the 
translation error associated with the linear method does not 
seem to be affected by a large rotation error. 

These experimental results seem however to confirm that the 
non-linear method provides a better estimation of the transla- 
tion vectors at the cost of slightly larger rotation errors. This 
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(a) Orientation errors. 



Noise Level 

(b) Relative position errors. 

Fig. 4. Errors in orientations and positions in the presence of Gaussian 
noise perturbing the rotation axes. The full curves (—) correspond to 
the method outlined in section II, the dotted curves (...) correspond 

to the closed-form solution and the dashed curves (-) correspond 

to the non-linear method. 

is due to the fact that the robot’s kinematic chain is not per¬ 
fectly calibrated and therefore there are errors associated with 
the robot’s translation parameters. Obviously, these errors do 
not obey the noise models used for simulations. 

VI. Discussion 

In this paper we addressed the problem of robot-to-world and 
hand-eye calibration. As it was proposed in [1] this problem is 
formulated as solving a system of homogeneous transformation 
equations of the form AX = ZB. 

We develop two resolution methods, the first one solves for 
rotations and then for translations while the second one solves 
simultaneously for rotations and translations. The first method 
leads to a closed-form solution while the second one leads to 
non-linear optimization. 

Both the sensitivity analysis and the results obtained with 
experimental data show that the closed-form method slightly 
outperforms the linear method of Zhuang et al. [1]. This is 



Square root of the Number of Poses 


(a) Orientation errors. 
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(b) Relative position errors. 

Fig. 5. Errors in orientations and positions as a function of the number 
of positions. A Gaussian noise is added both to the robot and camera 
positions. The full curves (—) correspond to the method outlined in 

section II and the dashed curves (-) correspond to the non-linear 

method. 

most probably due to the Euclidean nature of the error func¬ 
tion suggested in Section III-A. However, there is no evidence 
that with real data the closed-form method will always perform 
better than the linear method: One can therefore conclude that 
the two methods have comparable performances. 

The non-linear minimization method suggested in Section III- 
B yields the most accurate results and outperforms both the 
linear and closed-form methods. The solution obtained with 
either the linear or closed-form methods can be used to initialize 
the non-linear minimization method. 

The two methods proposed in this paper together with [1] 
may be useful to other problems that can be formulated into 
homogeneous transformation equations of the form AX = ZB. 
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